^ . SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: CATCHESIDE, DAVID E. 

(ii) TITLE OF THE INVENTION: REAGENTS AND METHODS FOR DIVERSIFICATION 

OF DNA 

(iii) NUMBER OF SEQUENCES : 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant, Gould, Smith, Edell, Welter & Schmidt 

(B) STREET: 3100 Norwest Center, 90 South 7th Street 

(C) CITY: Minneapolis 

(D) STATE: MN 
{E) COUNTRY: USA 
(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/977,171 

(B) FILING DATE: 24-NOV-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Skoog, Mark T 

(B) REGISTRATION NUMBER: 40,178 

(C) REFERENCE/DOCKET NUMBER: 10552. 13US01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-332-5300 

(B) TELEFAX: 612-332-9081 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GATCGCAACT GGAGATCACT CGCACCGTGC CGCAGAACAA GGGCGACGAG CCTCAGGGCA 60 

GTTTAGCCTG CCGTAACAGC ACAGACCATA GCTTATTTTC ACCTGGGCGG GCGGGCGACG 120 

GCGGCACTGA CATCGGCAAG GCGGCATCAA GCAACCCCTC TGTTGCTTGC CAGCTGCCGG 180 

CCAACGTCAG CGGTACAAGG AGAAATCTGG AAGGAAAGAC TTCTGGCACC GACAGGATGG 240 

CACGCGGGAA AAGTTCCCAA TGCATGAGAT GAGGGGCATT TGCATTGCCT CCCGTCACAC 300 

TGCCCGCGAA CCCCAACCCC ACCATAGCGT CTGTCGATAC ATGGAGCGCG AAGTCGAGAA 360 

ACCTGTAATT CCTGGTAACT TTCAGGTACA CAGTACGTAC TGATCCTGGT ATCAAACCTT 420 

GCCTGCCGAG TTTTCGACGG AAAGAGGTGT GAATTGTGAA AGAGTCATAC CAAATCACCC 480 

GATTTTCATA AAGCCCGAGT CTTTTCTGTA CATAAGCGAC ACTCGAAGCG GGCCTCATCT 54 0 

TCATAGCCTG ATAGCTTGTA ATACTCCATC CTCGTATCTC ACTTGACCTT GAGTTCAACC 600 

CCACGTCAGA CTTCACCCGA CACATCGACG GATTGGGGAA CAGCACAATA CCTGAAAAGC 660 

GAGAAAACCA AACAGAGGAA AACACCATGG AGACAACACT TCCCCTCCCC TTCCTCGTCG 720 

GTGTCAGTGT TCCTCCCGGA CTGAATGACA TCAAGGAGGG CCTCAGCCGG GAGGAAGTCT 780 

CGTGTCTTGG CTGCGTCTTC TTCGAGGTCA AGCCCAAGAC CCTTGAGAAA ATCGTGCGAT 84 0 

TCCTCAAGCG TCACAATGTC GAATTTGAGC CCTACTTCGA TGTAACAGCC CTCGAGTCTA 900 

TCGATGATAT TATCACTCTT CTGGACGCCG GCGCCCGCAA GGTGTTTGTC AAGACCGAGC 960 

AGTTGGCCGA CCTCTCCGCA TATGGCTCCC GCGTTGCCCC CATTGTCACT GGAAGCAGCG 1020 

CTGCTTTGCT TTCCTCCGCC ACCGAGAGCG GCCTTTTGCT CTCCGGCTTC GATCAGACTG 1080 

CCTCCGAGGC TGCACAGTTT CTGGAGGAGG CCAGAGACAA GAAAATTACC CCCTTCTTCA 1140 

TCAAGCCCGT TCCTGGGGCC GATCTCGAAC AGTTCATCCA GGTCGCCGCC AAGGCTAACG 1200 

CCATCCCCAT CCTGCCATCC ACTGGCTTGA CAACAAAGAA GGACGAGGCC GGAAAGCTTG 1260 

CCATCTCCAC CATCCTCTCG AGCGTCTGGA AGTCTGACCG TCCCGATGGT CTGCTCCCCA 1320 

CCGTTGTCGT TGATGAGCAC GACACTGCTC TGGGTCTGGT CTACAGCAGT GCCGAGAGTG 1380 

TGAACGAGGC CCTCAGGACA CAGACTGGTG TCTATCAGAG CCGGAAGCGC GGTCTCTGGT 1440 

ACAAGGGTGC TACTTCCGGA GACACTCAGG AGCTCGTCCG CATCTCGCTT GACTGCGATA 1500 

ACGATGCTCT CAAGTTTGTC GTGAAGCAGA AGGGTCGTTT CTGCCACCTC GATCAGTCCG 1560 

GCTGCTTTGG TCAGCTCAAA GGCCTTCCCA AGCTCGAGCA GACTTTGATT TCGAGGAAAC 1620 

AGTCTGCCCC CGAGGGCTCC TACACTGCCC GTCTCTTCTC CGATGAGAAG CTAGTCCGGG 1680 

CCAAGATCAT GGAGGAGGCT GAGGAGCTCT GCACCGCTCA GACCCCCCAG GAAATCGCCT 1740 

TTGAGGCTGC CGATCTCTTC TACTTTGCTC TTACCAGGGC CGTTGCTGCC GGCGTTACTC 1800 

TTGCCGATAT CGAAAGGAGC CTTGACGCCA AGAGCTGGAA GGTCAAGCGC AGGACTGGAG 1860 

ATGCTAAGGG TAAGTGGGCT GAGAAGGAGG GCATCAAGCC TGCGGCGTCC GCTCCCGCTG 1920 

CCACTTCGGC CCCTGTCACC AAGGAGGCCG CCCAGGAGAC CACCCCTGAG AAGATCACCA 1980 

TGAGACGTTT CGACGCCTCC AAGGTCTCTA CCGAGGAGCT CGATGCTGCT CTCAAGCGTC 2040 

CTGCGCAAAA GTCGTCCGAT GCCATCTACA AGATCATTGT CCCCATCATC GAGGACGTCC 2100 

GCAAGAACGG CGACAAGGCT GTTCTGTCGT ACACTCACAA GTTCGAGAAG GCTACCTCTC 2160 

TTACTAGCCC CGTCCTGAAG GCGCCCTTCC CCAAGGAGCT TATGCAGCTC CCTGAGGAGA 2220 

CCATTGCTGC CATCGACGTG TCCTTCGAGA ACATCCGCAA GTTCCACGCC GCCCAGAAGG 2280 

AGGAGAAGCC CCTCCAGGTC GAGACCATGC CCGGTGTTGT CTGCAGCCGT TTCTCTCGTC 2340 

CCATCGAGGC CGTCGGCTGC TACATCCCCG GCGGTACCGC CGTTCTCCCC AGCACTGCCC 2400 

TTATGCTGGG TGTTCCCGCC ATGGTCGCCG GCTGCAACAA GATTGTGTTC GCCTCTCCTC 24 60 

CCCGCGCCGA CGGAACCATC ACTCCCGAGA TTGTCCACGT CGCTCACAAG GTTGGGGCCG 2520 



AGTCCATCGT GCTTGCCGGC GGTGCCCAGG CCGTAGCTGC CATGGCCTAC GGCACCGAGA 2580 

GCATCACCAA GGTCGACAAG ATTCTCGGCC CCGGTAACCA GTTCGTCACT GCTGCCAAGA 2640 

TGTTCGTCAG CAACGACACC AACGCTGCCG TTGGGATTGA CATGCCCGCT GGCCCGTCCG 2700 

AGGTGCTGGT CATCGCTGAC AAGGACGCCA ACCCCGCGTT CGTTGCCTCG GATCTCCTGT 2 760 

CCCAGGCTGA GCACGGCGTT GACAGTCAGG TCATCCTGAT CGCTATTAAC CTCGACGAGG 2 820 

AGCATCTTCA GGCTATTGAG GACGAGGTTC ACCGTCAGGC TATGGAGCTT CCTCGCGTCC 2880 

AGATTGTCCG TGGCTCCATC GCCCACTCGA TCACCGTGCA GGTCAAGACC GTCGAGGAGG 2 940 

CCATGGAGCT CAGCAACAAG TACGCTCCTG AGCACTTGAT CCTCCAGATC AAGGAGGCCG 3 000 

AGAAAGCTGT CGATCTTGTC ATGAACGCTG GTAGTGTCTT CATTGGCGCT TGGACTCCTG 3 060 

AGTCCGTTGG CGATTACTCT GCTGGTGTTA ACCACTCGCT GCGTAAGTTA CATATCATAA 3120 

ATAGCCCCGC TTCACAGATT CTTCTGCTAA CGTCAAGACA CATAGCTACC TATGGTTTTG 3180 

■ GCAAGCAGTA CTCTGGCGTC AATCTCGCCT CGTTCGTCAA GCACATTACC AGCTCCAACT 324 0 

TGACTGCCGA GGGTCTCAAA AACGTCGGCC AGGCTGTCAT GCAGTTGGCT AAGGTTGAGG 3300 

AGCTCGAGGC TCACAGAAGG GCGGTCAGCA TCCGTCTTGA GCACATGAGC AAGAGCAACT 3 3 60 

AGACGGAAAT TCTTTTTCGA AGTTGCAAAA AAAACAAGAA CAAAAGGATG TAGTGGGTTG 3420 

ATGTATATCT GGGTCATTTT GGGCACATAG AGTAATGATA ACGAGTTTTG GACATTGTAC 34 80 

TGTTCTGTAC AGGCTGAAGA TCAGTACATG AATCTGTTGG TAAGTGTAGA GACCCAAACG 3 54 0 

TCCCTTGAGT TTTTCTCCCT GTTCCAGAGA GGTGCTCGTC CCTGGGTGTT TATTTTCATT 3600 

ATTACATCAA CCTTTTATTT TATTTTATTT TTTATTTTAC TTTTTTTTCC TTTTTTTCAG 3660 

ATCATGCGTA CATGAACGGG GGAAGCACAG ACGATCGAAA CGTGGATGTC ACAATGTCGC 3720 

TGCAGTGATG CTGCATTGCA TGAAGCGCCC ATCTCAATAT ACTTGCAGTC TTGCGCGTTG 3 780 

CACGTGAACT TCCCAAACAA CCGAATAAAA GACGGCGAAA AATGAAGATA AAAAAAAACC 3 840 

ATAATAAAAA TCGGAGGGAG TGTGGGAAAT GGTTTCTTTT AGCATTTAGA CCCCATAGCC 3 900 

GTGCACGCCC GGGTACAGAC AGGTTCATCG ATGTTGACAT TGACTGGGAC ACCAGGTCTA 3 960 

TCTATTTCAT CTCCTGTCCT CTACCATACA TCGGGACATC GGACATCTCG CTGTACCCCC 4 020 

CACACCCACA AAGTCTTATA AAAGCGCCAC ACCCGAGGAG GTTCGGTCGG CCCCACGAAC 4 080 

TCCGTGCCTC CCTGGGTGTT TACAGGGACC GAACGCTGGA GAAGCTTAGT TTCCTGACAT 4140 

CCGGCCTACC CGAGCAGGAA AAGGGACAGC TCATAGGCGA GGAGGGATTT GAAGATGGGG 4200 

ACATTTTGGA TGATTCGAGA GGAGGAACTA GGTACTGTAT CATGATAGTT CGGGGCAGCA 4260 

TCTTGGCTGG GACATTGTTA ATACCTCGAT ATGATGAAGT GGGAGGGAGT TTTTTCATGT 4320 

CTTGCCCAAG TCCCACTAAT CTTTTTTTTT TTTTGTACCA ACACCCAAGA TTCGGAGAAT 4380 

AGTGTAAGGA TTCGCATTCA CAAGTGGAAG TCTGAGGATC TTTTTATATC TTTGTCTTCC 4440 

GCGGACTGTT AACGATCCTA CAGCGAGCGA GCGAGCGGTC GGATGCGCTG ATCTGATAGG 4 500 

TGCAATATAC GGCCGCTTTC TCCGGTCGTG TAGTGTAAGC TCTGTCGGCA TAGTAGTACA 4560 

CTAAAAAAAC CCTTGCATTT CATGATCTGC TTGCTATTCA TTCCGAGTTA TTTCAGTGGT 4 620 

CACATTTCGA GATTCACAGC CATCCATCCA TATGGAAAAA TCCATTCCCA TGCTTCCTCC 4680 

CCCCCACTAT GTATGTGACC ACACGCTGCT GTCAGAATGC CAACGGTCTC AGGTACCCTC 4740 

GTCCGACTGT TTGGCATGGA GTTACATACA CTACTAGTGT AGCCCCGGGC CAAGCTACCC 4800 

CGTCAAATCT ATACATATCT ATAATGGGTT TCAGGTGTTT CGTTCGCTGT CAATCAAGTT 4 860 

TGAAACATCA CTGGGGCCGT TGGACGGTGT ATTAGACCAT TGGCTCCCTC AGCTGGCGGC 4920 

TGGGCGGTTG GGTCGGCAAT AACGGGACTG GACTTGAGAG GGACGAGGAG AGTCGGTTGG 4980 

CTGCCTACAC TACACTACAA GCGTTCCCAC CTAACCGACG AGTCCCGTTT TCCATTTGTG 5040 

TGCCTTAACC ATCATCTAGG GATGTCAGGG TTTGGCCGGA TCAGGGTATG TTTGGTTGAC 5100 

TGTTGTCATG TCTGATTGGG TACATATCAT GGTAGGTGTC TCGAGAACAG TAGAGTACTC 5160 

GGGCCTAGCG TTTGGATGAT TACGCGAGAT ATGAGTTGTA GGCCGCCATG CAGTTGCTTG 5220 

CCCATAAGCA GAAGTTGCTT TGGGATATAT TTCTCGTCTT TCAAAGGTCA CGAGGTCCTG 5280 

GGACGAGCGG CATCGCCATC CAAAGGGTTG AACATGAGAA ACCGGAATGG CCTTTGCGTT 534 0 

GAAATACAAA AAGTCAAGAA TAAAATCGCT TGAGGATAGG GACGTGGAAG CAAGCAAATA 54 00 

TGGTAAGGGA GGTACTGCTA TGTAGGTGCT CAGCAAACTG CCAATTTCTT GGCCCCCAAG 5460 

CAGCAGTTTG CTGTCAGTGC TGCTCGTGTC AGCCTTGGTA GTGGAACCTA AACTGCTAAC 5520 

ACAGCGCAAG TGCGCATGTA AAGATATTGT GGGAGGATCT GTATGGATGG ATGAGATTAC 5580 

TGCTTGGTGT TGGTTGCGAG GCACTGCGGC TGTTAGGCTT TGCTGTGCCC CGTTCGACGA 5640 

AGAAATACGC GGAACTATAA ATTGGATACC TAGACTTACT GCCTATGGGA GGTATCTACC 5700 

GACGTAGCCG ACGGATTCTA GCAACATCCC GACTTTGCTT GTAGTGTACT ATGATAGCAG 576 0 
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CACAGTGGGG TGTTGCTCCT TGTGAGCATG GGCTCTTTTT TTTTTTTTCC CCCTTCCCTA 5820 

GGGCGTTGAC TGGACTTGCT CTATCGTTCC CAAGGTAGGT GCCCGTCATC GATTTTCCCA 5880 

AGCCGTCTCC CGCCAGATTG TCGTCATAGT GTCATGATGA CCTCGGTCGC TGGGGCTGCG 594 0 

TGGTTACGGG GAGCTGGGAC CGCTAGGCCT CAGTGGTTGT GCCATTCAGC GTGGGTGTGT 6 000 

GGAGTAGCGG TAGAGGCGCT TGGAAGTTGT GCTAGCGGAA ACCCTGGAAT ATCTTGTACC 6 060 

CTTCGATTCC TTCTCGGGCT GCCCATGTGC TGAGGTGATG CCGGGGATCT GGCGCCAATC 612 0 

ATCCATTGAG GTTCCCGCAG CTTCCCGGTG CCGCGCGCGG GCGCAGTTGC TCACAGGACA 618 0 

CACCTAGACG CAGGGGCACA GGGGCACCGT TTGGTGTGCA ACTGGGTACC TGGTAGCTGT 624 0 

AGCAAGCACT CCACCGTCTG TGCAATCCCC CAATCCACGG CAGGAACTTA GCACCGCCGC 63 00 

GGCACCGAGT GAGCGAATCC ATCCGCATTG GATCCCAATT CTTGCCCTTG CCATCCTTCT 63 60 

TTCTTCCCAC TTGGCGCAAC CAACACTTCC CTTGGTCTGG GTACTCGTGT TGATCTTCAC 6420 

TCTCTTTTTT TCTTGGGCGA CCGACTTTTT ATATCCGTCC TTGCTTCCCC CTGGCCGTTG 64 80 

TCGTTCTTTC TACAACTACC TTCCGTTCAT TATCCCCTTT CTTGGTTCGG TCGAGGACCC 654 0 

AAAAACAGAA CAATTCCGGC TCTTCCAGGT GGCTTGGGTG CGACTGTTTA GCTCTTGACC 66 00 

ACTAGCCGCT TACCTTCTCT TGATGTTTAT ATTTGGATAT CATTGAACTA CTCTTTCTTG 6660 

AAACGGCAGA CGAACGGAAC AGTCCCTACG GTTTATTAGC GATATACGTT GTACTGATAT 672 0 

CCTGAGCAAG AAGAGGCAAA TTATCAATTA TGCATCTCCC ATCGTCGCTG CTCATCGCAG 6780 

CTCCCTTGCT CGCCAATGTA TCGGCCGAAC CGATTAGGAT ACCCCAACGC GATGTTCTCC 6840 

GTGGTATCAA CATCACAGCA ACTTGCCGTT CGAGCACTAC CGAATTCGCC CAGCGGTGGA 6900 

TATGCCCCTG CCGTTGTAGA CTGTCCCAAG ACCAAGCCGA CGCTCCGGAA GGCCGTGGAT 6960 

TTGTCGAACG AGGAGAAGAA CTGGTTGTCG ATCCGGAGGA AGAACACCAT CCAGCCCATG 7020 

AGGGACCTAC TGAAGAGGGC CAACATCACT GGGTTCGATT CCGAAACTTT CATGAATGAG 7080 

GCCGCCAACA ACGTCTCGCA ACTGCCCAAT GTCGCCATTG CCATTTCAGG AGGCGGCTAT 714 0 

CGTGCCCTCA TGAACGGCGC CGGCTTCGTT GCTGCTGCGG ATAACCGGAT TCAAAATACC 7200 

ACGGGCGCAG GTGGTATTGG AGGCTTGTTG CAGTCCAGCA CATATTTGTA TGTAAAACCA 7260 

TGCCTTCTTG TGGTTCTTCT TATCTCGTTT TCGAGTGTCA ACTGCGCCAG TTCGACGTTG 7320 

GGCGGCTGTG GACGACCTTG CTGGTGAACA TGTCTTGGAC TCCATGCCCC TTTTTTCGTT 73 80 

CCCTAAAATC CCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAATTCGAG 7440 

GACCGTGACT GTAAATTGCT AACGCAACTC TAGGGCCGGA CTTTCTGGTG GTGGCTGGCT 7500 

TGTCGGCAGT TTGTTCTCCA ACAACTTCAG TAGCATTGAG ACCCTGCTGA GCGAGAACAA 7560 

AGTCTGGGAC TTTGAGAACT CCATCTTTAA AGGACCCAAG GAGGCTGGCC TTAGTACTGT 7620 

CAACCGTATC CAGTACTGGT CCGAAGTGGC AAAGGAAGTT GCGAAGAAGA AGGATGCTGG 7680 

CTTCGAGACA AGTATAACAG ACTACTGGGG CCGAGCATTG AGTTACCAAC TGATCGGAGC 7740 

CGATATGGGC GGCCCGGCTT ACACCTTCTC CAGCATTGCC CAGACCGACA ACTTCCAGAA 7800 

GGCCGAAACG CCGTTCCCTA TTCTGGTAGC TGACGGCCGC GCGCCTGGAG ACACCATCAT 7860 

CTCCCTCAAT GCTACCAACT ACGAGTTCAA CCCGTTCGAG ACGGGTAGCT GGGACCCGAC 7920 

CGTCTATGGC TTTGCGCCGA CCAAGTACCT CGGCGCCAAC TTCAGCAACG GCGTGATCCC 7980 

ATCGGGAGGC AAGTGCGTTG AGGGTCTCGA CCAAGCCGGC TTCGTCATGG GCACCAGCAG 8040 

CACGCTCTTC AACCAGTTCC TTTTGGCCAA CATCTCCAGC TACGACGGTG TTGCCAGACG 8100 

TGCTCATCGA GGCCGTGACT TCTGTCCTCA AGGAAATCGG CGCCAAGAGG ACGACGTCTC 8160 

CCAAATCATC CCTAATCCGT TCCTGGACTG GAACAACCGG ACCAACCCCA ACGCCGACAC 8220 

GCTCGAGCTC GACCTGGTCG ACGGCGGCGA AGATCTGCAG AATATTCCGC TCAACCCGCT 8280 

CACCCAACCC GTGCGCGCCG TCGACGTCAT CTTCGCTGTC GACTCGTCCG CCGACGTGAC 834 0 

AAACTGGCCC AATGGCACCG CCCTGCGCGC CACCTACGAG CGCACTTTCG GCTCTATTTC 84 00 

CAACGGGACA CTCTTCCCCT CGATCCCCGA CGACTGGACG TTTATAAACC TAGGCCTCAA 8460 

CAACCGCCCC TCTTTCTTCG GCTGCGATGT TAAGAACTTT ACCTTGAACG CCAACCAAAA 8520 

GGTTCCCCCC TTAATCGTCT ATGTCCCCAA CGCGCCCTAT ACCGCGCTGA GCAACGTGTC 8580 

CACCTTCGAT CCGTCATACA CGATGTCTCA GCGCAACGAC ATCATCGGCA ACGGATGGAA 8640 

CTCAGCCACG CAGGGAAACG GCACGCTGGA TTCGGAGTGG CCCACTTGCG TCGCCTGCGC 8700 

GGTTATCAGC AGGAGCTTAG ATCGGTTGGG CAGGCAGACG CCAGCCGCGT GCAAGACTTG 8760 

CTTTGACAGG TATTGCTGGA ATGGCACAGT GAACTCCAAA GATACGGGGG TTTACATGCC 8820 

TGAGTTCAAG ATTGCGGATG CGCATGCCCT GGACTCGGGT GCTGTTGCTA TCGGAAAGAT 8880 

GGTGAATGTC TGGTCGTCGG TTGTGGTGGG AGTTGTGGCG GCTACTTTGT TGTTGTAGGG 8 94 0 

GTAGGGGAGA CGTGATGATA TTCCAGTCTG ATGAAGTTGA GACTGGACTG GAGATCGCCA 9000 
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AGGATGCGGA GGGAAAGGAA TGCGTGGTGT TAATGTCATG ATGGATGAAG AGTCATGGAT 906 0 

CATGGAACGA CGGGGCGGGG ATATTGGATG ATGGATATAC CACACTGCAT GCATGCTCTA 9120 

TTGATAGTAT GCTTTGGCAT TTACGTTTAA CAATCAATTG CTCCATCCTG ATGTTCTATC 9180 

TTTTTCGACA ATGGATTGAT ACTACTCCTG TTGCTTCGCT CTTGAGGTTG GAAGGACTTG 9240 

AGGTTGGAAG GACTTGAGGT TGTTTGTTCT GAGGGAGGTT ATCGAAGTAT CATCTGTGCT 93 00 

GATGCCGATT GATAGACTGT CCTCTTCTTC GAGGCAACGA ACGGTCGGAT GAGCCTCTTT 93 60 

AATCATGATG CTCAGTGCCA CAAAAAGGCT CCAGCACAGC TGCCCACACC TTTCTTGCCT 9420 

CGCCGTTCCT TCCTTTTTCT TTTCCCCTGT TTCCTTTCTT CCTTTCCATC TCATCCCGTA 9480 

CCAGAGTGCC CACCGGGTAT ATATATTACC TCCTTGGCCG TTCTCCTTTG ACCAATAAAT 954 0 

CGCTTGGTCG AGTGGCGTAA CGGTTTACCG TCTACACTTA TCACTCAAAC CAAACCAAAC 9600 

CATCGAAGAA GTGACCTATC GGTTCGAGGG AACGGTGATG TTCTTACGAC CAAGTTAACC 9660 

CAAAGAGCGT TCCACATCGT TGAACCGTCT CCTCCAGTTG GATCTGTTTA ACTTCCGCAG 9720 

CGACTGAAGA AGGTATCACT TTTTTTTTGG TTCCAAAAAA AAAAAAAAAA ATTAC 9775 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9934 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ACCGGGAATC GTAGCGGGCG CTAAGGCCAA GCCGCGGCAC GGGTCACTGA CCCAATGCAG 60 

CGCATTCCGT CAGCAACTGA AGTGGATGTA CAAGTACATA GTAGTAGATC GCAACTGGAG 120 

ATCACTCGCA CCGTGCCGCA GAACAAGGGC GACGAGCCTC AGGGCAGTTT AGCCTGCCGT 180 

AACAGCACAG ACCATAGCTT ATTTTCACCT GGGCGGGCGG GCGACGGCGG CACTGACATC 240 

GGCAAGGCGG CATCAAGCAA CCCCTCTGTT GCTTGCCAGC TGCCGGCCAA CGTCAGCGGT 300 

ACAAGGAGAA ATCTGGAAGG AAAGACTTCT GGCACCGACA GGATGGCACG CGGGAAAAGT 360 

TCCCAATGCA TGAGATGAGG GGCATTTGCA TTGCCTCCCG TCACCCAGTG CGAACCCCAA 420 

CCCCACCATA GCGTCTGTCG ATACATGGAG CGCGAAGTCG AGAAACCTGT AATTCCTGGT 480 

AACTTTCAGG TACACAGTAC GTACTGATCC TGGTATCAAA CCTTGCCTGC CGAGTTTTCG 540 

ACGGAAAGAG GTGTGAATTG TGAAAGAGTC ATACCAAATC ACCCGATTTT CATAAAGCCC 600 

GAGTCTTTTC TGTACATAAG CGACACTCGA AGCGGGCCTC ATCTTCATAG CCTGATAGCT 66 0 

TGTAATACTC CATCCTCGTA TCTCACTTGA CCTTGAGTTC AACCCCACGT CAAACTTCAC 72 0 

CCGACACATC GACGGATTGG GGAACAGCAC AATACCTGAA AAGCGAGAAA ACCAAACAGA 780 

GGAAAACACC ATGGAGACAA CACTTCCCCT CCCCTTCCTC GTCGGTGTCA GTGTTCCTCC 84 0 

CGGACTGAAT GACATCAAGG AGGGCCTCAG CCGGGAGGAA GTCTCGTGTC TTGGCTGCGT 900 

CTTCTTCGAG GTCAAGCCCC AGACCCTTGA GAAAATCCTG CGATTCCTCA AGCGTCACAA 960 

TGTCGAATTT GAGCCCTACT TCGATGTAAC AGCCCTCGAG TCTATCGATG ATATTATCAC 102 0 

TCTTCTGGAC GCCGGCGCCC GCAAGGTGTT TGTCAAGACC GAGCAGTTGG CCGACCTCTC 1080 

CGCATATGGC TCCCGCGTTG CCCCCATTGT CACTGGAAGC AGCGCTGCTT TGCTTTCCTC 1140 

CGCCACCGAG AGCGGCCTTT TGCTCTCCGG CTTCGATCAG ACTGCCTCCG AGGCTGCACA 1200 

GTTTCTGGAG GAGGCCAGAG ACAAGAAAAT TACCCCCTTC TTCATCAAGC CCGTTCCTGG 1260 

GGCCGATCTC GAACAGTTCA TCCAGGTCGC CGCCAAGGCT AACGCCATCC CCATCCTGCC 1320 

ATCCACTGGC TTGACAACAA AGAAGGACGA GGCCGGCAAG CTTGCCATCT CCACCATCCT 1380 

CTCGAGCGTC TGGAAGTCTG ACCGTCCCGA TGGTCTTCTC CCCACCGTTG TCGTTGATGA 1440 

GCACGACACT GCTCTGGGTC TGGTCTACAG CAGTGCCGAG AGTGTGAACG AGGGCCTCAG 1500 

GACACAGACT GGTGTCTATC AGAGCCGGAA GCGCGGTCTC TGGTACAAGG GTGCTACTTC 1560 

CGGAGACACT CAGGAGCTCG TCCGCATCTC GCTTGACTGC GATAACGATG CTCTCAAGTT 1620 

TGTCGTGAAG CAGAAGGGTC GTTTCTGCCA CCTCGATCAG TCCGGCTGCT TTGGTCAGCT 1680 

CAAAGGCCTT CCCAAGCTCG AGCAGACTTT GATTTCGAGG AAACAGTCTG CCCCCGAGGG 1740 

CTCCTACACT GCCCGTCTCT TCTCCGATGA GAAGCTAGTC CGGGCCAAGA TCATGGAGGA 1800 

GGCTGAGGAG CTCTGCACCG CTCAGACCCC CCAGGAAATC GCCTTTGAGG CTGCCGATCT 1860 

CTTCTACTTT GCTCTTACCA GGGCCGTTGC TGCCGGCGTT ACTCTTGCCG ATATCGAAAG 1920 

GAGCCTTGAC GCCAAGAGCT GGAAGGTCAA GCGCAGGACT GGAGATGCTA AGGGTAAGTG 198 0 

GGCTGAGAAG GAGGGCATCA AGCCTGCGGC GTCCGCTCTC GCTGCCACTT CGGCCCCTGT 204 0 

CACCAAGGAG GCCGCCCAGG AGACCACCCC TGAGAAGATC ACCATGAGAC GTTTCGACGC 2100 

CTCCAAGGTC TCTACCGAGG AGCTCGATGC TGCTCTCAAG CGTCCTGCGC AAAAGTCGTC 216 0 

CGATGCCATC TACAAGATCA TTGTCCCCAT CATCGAGGAC GTCCGCAAGA ACGGCGACAA 222 0 

GGCTGTTCTG TCGTACACTC ACAAGTTCGA GAAGGCTACC TCTCTTACTA GCCCCGTCCT 2280 

GAAGGCGCCC TTCCCCAAGG AGCTTATGCA GCTCCCTGAG GAGACCATTG CTGCCATCGA 234 0 

CGTGTCCTTC GAGAACATCC GCAAGTTCCA CGCCGCCCAG AAGGAGGAGA AGCCCCTCCA 2400 

GGTCGAGACC ATGCCCGGTG TTGTCTGCAG CCGTTTCTCT CGTCCCATCG AGGCCGTCGG 2460 

CTGCTACATC CCCGGCGGTA CCGCCGTTCT CCCCAGCACT GCCCTTATGC TGGGTGTTCC 2520 
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CGCCATGGTC GCCGGCTGCA ACAAGATTGT GTTCGCCTCT CCTCCCCGCG CCGACGGAAC 2580 

CATCACTCCC GAGATTGTCC ACGTCGCTCA CAAGGTTGGG GCCGAGTCCA TCGTGCTTGC 264 0 

CGGCGGTGCC CAGGCCGTAG .CTGCCATGGC CTACGGCACC GAGAGCATCA CCAAGGTCGA 2700 

CAAGATTCTC GGCCCCC^GTA ACCAGTTCGT CACTGCTGCC AAGATGTTCG TCAGCAACGA 2760 

CACCAACGCT GCCGTTGGTA TTGACATGCC CGCTGGCCCG TCCGAGGTGC TGGTCATCGC 2 820 

TGACAAGGAC GCCAACCCCG CGTTCGTTGC CTCGGATCTC CTGTCCCAGG CTGAGCACGG 2 880 

CGTTGACAGT CAGGTCATCC TGATCGCTAT TGACCTCGAC GAGGAGCATC TTCAGGCTAT 2 94 0 

TGAGGACGAG GTTCACCGTC AGGCTACGGA GCTTCCTCGC GTCCAGATTG TCCGTGGCTC 3 000 

CATCGCCCAC TCGATCACCG TGCAGGTCAA GACCGTCGAG GAGGCCATGG AGCTCAGCAA 3060 

CAAGTACGCT CCTGAGCACT TGATCCTCCA GATCAAGGAG GCCGAGAAGG CTGTCGATCT 3120 

TGTCATGAAC GCCGGTAGTG TCTTCATTGG CGCCTGGACT CCTGAGTCCG TTGGCGATTA 3180 

CTCTGCTGGT GTTAACCACT CGCTGCGTAA GTTACATATC ATAAATAGCC CCGCTTCACA 3240 

GATTCTTCTG CTAACGTCAA GACACATAGC TACCTATGGC TTTGGCAAGC AGTACTCTGG 33 00 

CGTCAATTTC GCCTCGTTCG TCAAGCACAT TACCAGCTCC AACTTGACTG CCGAGGGTCT 33 60 

CAAAAACGTC GGCCAGGCTG TCATGCAGTT GGCTAAGGTT GAGGAGCTCG AGGCTCACAG 3420 

AAGGGCGGTC AGCATCCGTC TTGAGCACAT GAGCAAGAGC AACTAAACGG AAATTCTTTT 3480 

CGAAGTAGCA AAAAAAAAAA AAAAAAACAA GAACAAAAGG ATGTAGTGGG TTGATGTATA 3540 

TCTGGGTCAT TTTGGGCACA TAGAGTAATG ATAACGAGTT TTGGACATTG TACTGTTCTG 3600 

TACAGGCTGA AGATCAGTAC ATGAATCTGT TGGTAAGTGT GGAGACCCAA ACGTCCCTTG 3660 

AGTTTTTCTC CCTATTCCAG AGGTGCTCGT CCCTGGGTGT TTATTTTCAT TATTACATCA 3720 

ACCTTTTTTT tTTTTTTTTT TTTTTCAGAT CATGCGTACA TGAACGGGGG AAGCACAGAC 3780 

GATCGAAACG TGGATGTCAC AATGTCGCTG CAGTGATGCT GCATTGCATG AAGCGCCCAT 3840 

CTCAATATAC TTGCAGTCTT GCACGTTGCA TGTGAACTTC CCAAACAACC GAATAAAAGA 3900 

CGGCGAAAAA TGAAGATAAA AAAAAACCAT AAAAAAAATC AGAGGGAGTG TGGGAAATGG 3 960 

TGTCTTTTAG CATTCAGACC CCATAGCCGT GCACGCCCGG GTACAGACAG GTTCATCGAT 4020 

GTTGACATTG ACTGGGACAC CAGGTCTATC TATTTTATCT CCTGTCCTCT ACCATACATC 4080 

GGGACATCGG ACATCTTGCT GTACCCCCCA CACCCACAAA GCCTTATAAA AGCGCCACAC 4140 

CCGAGGAGGT TCGGTCGGCC CCACGAACTC TGTGCCTCCC TGCCTGTTTA CAGGGACCGA 4200 

ACGCTGGAGA ATCTTACTAG TTTCCTGACA TCCGGCCTAC CCGAGCAGGA AAAGGGACAG 4260 

CTCATAGGCG AGGAGGGATT TGAAGATGGG AACATTTTGG GTGATTCGAG AGGAGGAACT 4320 

AGGTACTGCA TCATGATAGT TCGGGGCAGC ATCTTGGCTG GGACATTGTT AATACCTCGA 4380 

TATGATGAAG TAGGAGGGAG TTTTTGCGTG TCTTGCCGAA GTCCAGAGAT CTGTTTTATT 4440 

TTATTTTTTA TGGATGTAGT GTATCAACAC CCAAGATTCG GAGAATAGTA CTAGGATTCG 4500 

CATTTACAAG TGGAAGTCTT GAGAATCGTT GTATATCCTT GTCTTCCTCG GAATGTTAAC 4560 

AATCCTACAG CGAGCGAGCG AGCGGTCGGA TGCGCTGATC TGATAGGCGC AATATACGGC 4620 

CGCTTTCTCC GGTCGTGTAG TGTAAGCTCT GTGGGCATAG TACACTAAAA AAACCCTTGC 4680 

ATTTCATGAT CTGCCTGCTA TTCATTCCGA GCTATTTCAG TGGTCACATT TCGAGGAAGA 4740 

AAGAAAGCAA CTAAGATTCA CAGCCATCCA TCCATCCATA TGGAAGAATA ATCCATTCCC 4800 

ATGTTCCCTC CCCCCCACTA TGTATGTGAC CACACGCTGC TGTCAGAATG CCAACGGTCT 4860 

CAGGTACCCT CGTCCGACTG TTTGGCATGG AGTTACATAC ACTACTAGTG TAGCCCCGGG 4920 

CCAAGCTACC CCGTCAAATC TATACATATC TATAACGGGT TTCAGGGGTT TCGTTCGCTG 4980 

TCAATCAAGT TTGAAACATC ACTGGGGCCG TTGGACGGTG TATTAGACCA TTGGCTCCCT 5040 

CAGCTGTTTG GCGGCTGGGC GGCTGGGTCA AACGGCAATA ACGGGACTCG AGAGGGACGA 5100 

GGAGAGTCGG TTGGCTGGCT GCAATACAAG CGTTCCCACC TAACCAACGA GTCCCGTTTT 5160 

CCATTTGTGT GCCTAACCAT CATCTAGGGA TGTCAGGGTT TGGCCGGATC AGGGTATGTT 5220 

TGGTTGACTG TTGTCATGTC TGATTGGGTA CATATTATGG TAGGTGTCTC GAGAACAGTA 5280 

GAGTACTCGG GCCTAGCGTT TGGATGATTA CGCGAGATAT GAGTTGTGGG CCGCCATGCA 5340 

GTTGCTTGTC CATAAGCAGA AGTTGCTTTG GGATATATTT CTCGTCTTTC AAAGGTCACG 54 00 

AGGTCCTGGG ACGAACGGCA TCGCCATCCA AAGGGTTGAA CATGAGAAAC CTGAATGGCC 5460 

TTTGCGTTGA AATACAAAAA GTCAAGAACA AAATCGCTTG AGGATAGGGA CGTGGAAGCA 5520 

AGCAAATATG GTAAGAGAGG TATACATCAA CCCTGGTTCA ATTGTTAGCG TGGTTCTTCC 5580 

TCCACGTCCT CGTTCATGAC GGTTAACAGT ACCAGGCTAA CAATTAAACC AGGGTTGATG 564 0 

TGTACTGATA TGTAGGTGCT CAGCAAACTG CCAATTTCTT TGGCCCCAAG CAGCAGTTTG 57 00 

CTGTCAGTGC TGCTCGTGTC AGCCTTGGTA GTGGAACCTA AACTGCTAAC ACAGCGCAAG 57 60 
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TGCGCATGTA AAGATATTGt GGGAGGATCT GTATGGATGG ATGAGATTAC TGCTTGGTGT 582 0 

TGGTTGCGAG GCACTGCGGC TGTTAGGCTT TGCTGTGCCC CGTTCGACGA AGAAATACGC 5880 

GGAACTATAA ATTGGATACC TAGACTTACT GCCTATGGGA GGTATCTACC GACGTAGCCG 5 94 0 

ACGGATTCTA GCAACAT'CCC GACTTTGCTT GTAGTGTACT ATGATAGCAG CACAGTGTTG 6000 

CTCCTTGTGA GAATGGGCTC tTTTTTTTTT TCCCCCTTCC CTAGGGCGTT GACTGGACTT 6060 

GCTCTATTGT TCCCAAGGTA GGTGCCCGTC ATCGATTTTC CCAAGTCTCC CGCCAGATTG 6120 

TCGTCATAGT GTCATGATGA CCTCGGTCGC TGGGGCTGCG TGGTTACGGG GAGCTGGGAC 6180 

CGCTAGGCCT CAGTGGTTGT GCCATTCAGC GTGGGTGTGT GGAGTAGCGG TAGAGGCGCT 624 0 

TGGAAGTTGT GCTAGCGGAA ACCCTGGAAT ATCTTCTACC CTCGATTCCT TCTCGGGCTG 63 00 

CCCATGTGCT GAGGTGATGC CGGGGATCTG GCGCCAATCA TCCATTGAGG TTCCCGCAGC 63 60 

TTCCCGGTGC CGCGCGCGGG CGCAGTTGCT CACAGGACAC ACCTAGACGC AGGGGCACAG 6420 

GGGCACCGTT TGGTGTGCAA CTGGGTACCT AGCTGTAGCA AGCACTCCAC CGTCTGTGCA 6480 

ATCCCCCAAT CCACGGCAGG AACTTCGCAC CGCCGCGGCA CCGAGTGAGC GAATCCATCC 654 0 

GCATTGGATC CCAATTCTTG CCCTTGCCAT CCTTCTTTCT TCCCACTTGG CGCAACCAAC 6600 

ACTTCCCTTG GTCTGGGTAC TCGTGTTGAT CTTCACTCTC TTTTTTTCTT GGGCGACCGA 6660 

CTTTTTATAT CCGTCCTTGC TTCCCCCTGG CCGTTGTCGT TCTTTCTACA ACTACCTTCC 6720 

GTTCATTATC CCCTTTCTTG GTTCGGTCGA GGACCCAAAA ACAGAACAAT TCCGGCTCTT 6780 

CCAGGTGGCT TGGGTGCGAC TGTTTAGCTC TTGACCACTA GCCGCTTACC TTCTCTTGAT 684 0 

GTTTTTATTT GGATATCATT AAACTACTCT TTCTTGAAAC GGCAGACGAA CGGAACAGTT 6900 

CCTACGGTAT ATTAGCGATA TACGTTGTAC TGATATTCTG AGCAAGAAGA GGCAAATTAT 6960 

CAATTATGCA TCTCCCTTCG TCGCTGCTCA TCGCAGCTCC CTTGCTCGCC AATGTATCGG 7020 

CCGAACCCAT TAGGATACCC CAACGCGATG TTCTCCGTGG TATCAAGATC ACAGCAACTT 7080 

GCCGTTCGAG CACTACCGGA TTCGCCCAGC GGTGGATATG CCCCTGCCGT TGTAGACTGT 7140 

CCCAAGACCA AGCCGACGCT CCGGAAGGCC GTGGATTTGT CGAACGAGGA GAAGAACTGG 7200 

TTGTCGATCC GGAGGAAGAA CACCATCCAG CCCATGAGGG ACCTCCTGAA GAGGGCCAAC 7260 

ATCACTGGGT TCGATTCCGA GACATTTATG AATGAGGCCG CCAACAACAT CTCGCAACTG 7320 

CCCAATGTCG CCATTGCCAT TTCAGGAGGC GGCTATCGTG CCCTCATGAA CGGCGCCGGC 7380 

TTCGTTGCTG CTGCGGATAA CCGAATTCAA AATACCACGG GCGCAGGTGG TATTGGAGGC 7440 

TTGTTGCAGT CCAGCACATA TTTGTATGTA AAGTGGTTCT TCTTATCTCG TTTTCGAGTG 7500 

TCAACTGCGC CAGTTCAGAG TTGGGCGGCT GTGGACGACC TTGCTGGTGA ACATGTCTTG 7560 

GACTCCATGC CCCTTCTTCG TTTCCTCAAA TCAAGAAGTC GAGGACCGTG ACCGTAAATC 7620 

GCTAACGCAA CTCTAGGGCC GGACTTTCTG GTGGTGGCTG GCTTGTCGGC AGTTTGTTCT 7680 

CCAACAACTT CAGCAGCATT GAGACCCTGC TGAGCGAGAA CAAAGTCTGG GACTTTGAGA 7740 

ACTCCATCTT TAAAGGGCCC AAGGAGGCTG GCCTTAGTAC TGTCAACCGC ATTCAGTACT 7800 

GGTCCGAAGT GGCAAAGGAA GTTGCCAAGA AGAAGGATGC TGGCTTCGAG ACAAGTATAA 7860 

CAGACTACTG GGGCCGAGCA TTGAGTTACC AACTGATCGG AGCCGATATG GGCGGCCCGG ,7920 

CTTACACCTT CTCCAGCATT GCCCAGACCG ACAACTTCCA GAAGGCCGAA ACGCCGTTCC 7980 

CTATTCTGGT AGCTGACGGC CGCGCGCCTG GAGACACCAT CATCTCCCTC AATGCTACCA 8040 

ACTACGAGTT CAACCCGTTC GAGACGGGTA GCTGGGACCC GACCGTCTAT GGCTTTGCGC 8100 

CGACCAAGTA CCTCGGCGCC AACTTCAGCA ACGGCGTGAT CCCATCGGGA GGCAAGTGCG 8160 

TTGAGGGTCT CGACCAAGCC GGCTTCGTCA TGGGCACCAG CAGCACGCTC TTCAACCAGT 8220 

TCCTTTTGGC CAACATCTCC AGCTACGACG GTGTTGCCCG ACGTGCTCAT CGAAGCCGTG 8280 

ACTTCTGTCC TCAAGGAAAT CGGCGCCAAG AGGACGACGT CTCCCAAATC ATCCCTAATC 8340 

CGTTCCTGGA CTGGAACAAC CGGACCAACC CCAACGCCGA CACGCTCGAG CTCGACCTGG 84 00 

TCGACGGCGG CGAAGATCTG CAGAATATTC CGCTCAACCC GCTCACCCAA CCCGTGCGCG 84 6 0 

CCGTGGACGT CATCTTCGCT GTCGACTCGT CCGCCGACGT GACAAACTGG CCCAATGGCA 8520 

CCGCCCTGCG AGCCACCTAC GAGCGCACTT TCGGCTCTAT TTCCAACGGG ACACTCTTCC 8580 

CCTCGATCCC CGACGACTGG ACGTTTATAA ACCTAGGCCT CAACAACCGC CCCTCTTTCT 864 0 

TCGGCTGCGA TGTTAAGAAC TTTACCTTGA ACGCCAACCA AAAGGTTCCC CCCTTAATCG 8700 

TCTATGTCCC CAACGCGCCC TATACCGCGC TGAGCAACGT GTCCACCTTC GATCCGTCAT 8760 

ACACCATGTC TCAGCGCAAC GACATCATCG GCAACGGATG GAACTCAGCC ACGCAGGGAA 882 0 

ACGGCACGCT GGATTCGGAG TGGCCCACTT GCGTCGCCTG CGCGGTTATC AGCAGGAGCT 8880 

TAGATCGGTT GGGCAGGCAG ACGCCAGCCG CGTGCAAGAC TTGCTTTGAG AGGTATTGCT 8 94 0 

GGAATGGCAC AGTGAACTCA AAAGATACAG GGGTTTACAT GCCTGAGTTC AAGATTGCGG 9000 
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ATGCGCATGC CCTGGACTCCS GGTGCTGTTG CTATCGGAAA GATGGTGAAT GTCTGGTCGT 9060 

CGGTTGTGGT GGGAGTTGTG GCGGCTACTT TGTTGTTGTA GGGGTAGGGG AGACGTGATG 912 0 

ATATTCCAGT CTGATGAAGT. TGAGACTGGA CTGGAGATCG CCAAGGATGC GGAGGGAAAG 9180 

GAATGCGTGG TGTTAATGTC ATGATGGATG AAGGGTCATG GATCATGGAA CGACGGGGCG 9240 

GGGATATTGG ATGATGGATA TACCACACTG CATGCATGCT CTATTGATAA TATGCTTTGG 93 00 

CATTTACGTT TAACAATCAA TTGCTCCATC CTGATGTTCT ATCTTTCGAC ACTGGATTGA 93 60 

TACTACTCCT GTTGCTTCCC TCTTGAAGTT GGAAGGACTT GAGGTTGGAA GGACTTGAGG 9420 

TTGTTTGTTC TGAGGGAGGT TATCGAAGTA TCATCTGTGC TGATGCCGAT CGATAGACTG 9480 

CCCTCTTCTT CGAGGCAACG AACGGTCGGA TGAGCCTCTA ATCATGATGC TCAGTGCCAC 9540 

AAAAAGGCTC CAGCACAGCT GCCCACACCT TTTTTGCCTC GTCGCTCCTT CCTTTTTTTC 9600 

CCCCCCTTTC TTCCTTTCCA TCTCATCCCG TACCAGAGTG CCCACCGGGT ATATATATTA 9660 

CCTCCTTGGC CGTTCTCCTT TGACCAATAA ATCGCTTGGT CGAGTGGCGT AACCGTTTAC 972 0 

CGTCTACACT TATCACTCAA ACCAAACCAA ACCATCGAAG AAGTTACCTA TCGGTTCGAG 9780 

GGAACGGTGA TGTTCTTACG TTCAAGTTAA CCCAAAGAGC GTTCCACATC GTTGAACCGT 984 0 

CTCCTCCAGT TCTTGGATCT GTTTAACTTC CGCAGCGACT GAAGAAGTAA TCACTTTTTT 9900 

TTTTTTTGGT TCCAAAAAAA AAAAAAAAAA TTAC 9934 
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